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Polvketides and their Synthesis 

The present invention relates to processes and 
materials (including enzyme systems, nucleic acids, 
vectors and cultures) for preparing novel polyketides, 
5 particularly 12-, 14- and 16-membered ring macrolides, by 

recombinant synthesis and to the novel polyketides so 
produced. Polyketide biosynthetic genes or portions of 
them, which may be derived from different polyketide 
biosynthetic gene clusters are manipulated to allow the 
10 production of specific novel polyketides, such as 12-, 

14- and 16-membered macrolides, of predicted structure. 
The invention is particularly concerned with the 
replacement of genetic material encoding the natural 
starter unit with other genes in order to prepare 
15 macrolides with preferentially an acetate starter unit; 

or preferentially a propionate unit; or preferentially 
with an unusual starter unit, in each case minimising the 
formation of by-products containing a different starter 
unit . 

20 Polyketides are a large and structurally diverse 

class of natural products that includes many compounds 
possessing antibiotic or other pharmacological 
properties, such as erythromycin, tetracyclines, 
rapamycin, avermectin, monensin, epothilones and FK506. 



10 
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In particular, polyketides are abundantly produced by 
Streptomyces and related a c t inomyce t e bacteria. They are 
synthesised by the repeated stepwise condensation of 
acylthioes ters in a manner analogous to that of fatty 
acid biosynthesis. The greater structural diversity 
found among natural polyketides arises from the selection 
of (usually) acetate or propionate as A starter or 
"extender" units; and from the differing degree of 
processing of the -keto group observed after each 
condensation- Examples of processing steps include 
reduction to (3 -hydroxyacyl- , reduction followed by 
dehydration to 2-enoyl-, and complete reduction to the 
saturated acylthioes ter . The stereochemical outcome of 
these processing steps is also specified for each cycle 
of chain extension. 
15 The biosynthesis of polyketides is initiated by a 

group of chain-forming enzymes known as polyketide 
synthases. Two classes of polyketide synthase ( PKS ) have 
been described in act inomycet es . One class, named Type I 
PKSs, represented by the PKSs for the macrolides 
erythromycin, oleandomycin, avermectin and rapamycin, 
consists of a different set or "module'"'' of enzymes for 
each cycle of polyketide chain extension. For an example 
see Figure 1 (Cortes, J. et al. Nature (1990) 348:176- 
178; Donadio, S. et al. Science (1991) 2 523:675-679; 
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Swan, D.G. et al. Mol . Gen. Genet. (1994) 242:358-362; 
MacNeil, D. J. et al. Gene (1992) 115:119-125; Schwecke, 
T. et al. Proc. Natl. Acad. Sci. USA (1995) 92:7839- 
7843) . 

The term "extension module" as used herein refers to 



the set of contiguous domains, from a p-ketoacyl-ACP 
synthase PAKS") domain to the next acyl carrier protein 
("ACP") domain, which accomplishes one cycle of 
polyketide chain extension. The term "loading module" is 
used to refer to any group of contiguous domains which 
accomplishes the loading of the starter unit onto the PKS 
and thus renders it available to the KS domain of the 
first extension module. The length of polyketide formed 
has been altered, in the case of erythromycin 
biosynthesis, by specific relocation using genetic 
15 engineering of the enzymatic domain of the erythromycin- 

producing PKS that contains the chain releasing 
thioesterase/cyclase activity (Cortes et al. Science 
(1995) 268:1487-1489; Kao, CM. et al. J. Am. Chem. Soc . 
(1995) 117 : 9105-9106) . 

In-frame deletion of the DNA encoding part of the 
ketoreductase domain in module 5 of the erythromycin- 
producing PKS (also known as 6-deoxyerythronolide B 
synthase, DEBS) has been shown to lead to the formation 
of erythromycin analogues 5 , 6-dideoxy-3-a-mycarosyl-5- 
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oxoerythronolide B, 5, 6-dideoxy-5-oxoerythronolide B and 

5.6- dideoxy, 6 (3-epoxy-5-oxoerythronolide B (Donadio, S. 
et al. Science (1991) 252:675-67 9). Likewise, alteration 
of active site residues in the enoyl reductase domain of 
module 4 in DEBS, by genetic engineering of the 

5 corresponding PKS-encoding DNA and its introduction into 

Saccharopolyspora erythraea , led to the production of 

6 . 7- anhydroerythromycin C (Donadio, S. et al . Proc Natl. 
Acad. Sci. USA (1993) 90:7119-7123). 

International Patent Application number WO 93/13663 
10 describes additional types of genetic manipulation of the 

DEBS genes that are capable of producing altered 
polyketides. However many such attempts are reported to 
have been unproductive (Hutchinson, C. R. and Fujii, I. 
Annu. Rev. Microbiol. (1995) 49:201-238, at p. 231). The 
15 complete DNA sequence of the genes from Streptomyces 

hygroscopicus that encode the modular Type I PKS 
governing the biosynthesis of the macrocyclic 
immunosuppressant polyketide rapamycin has been disclosed 
(Schwecke, T. et al. (1995) Proc. Natl. Acad. Sci. USA 
20 92:7839-7843). The DNA sequence is deposited in the 

EMBL/Genbank Database under the accession number X86780. 

The second class of PKS, named Type II PKSs, is 
represented by the synthases for aromatic compounds. 
Type II PKSs contain only a single set of enzymatic 
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for chain extension and these are re-used as 
appropriate in successive cycles (Bibb, M. J. et al. EMBO 
J. (1989) 8:2727-2736; Sherman, D. H. et al. EMBO J. 
(1989) 8:2717-2725; Fernandez-Moreno, M . A. et al . J . 
Biol. Chem. (1992) 2 67:19278-19290). The "extender" 



5 units for the Type II pKSs are usually acetate units, and 

the presence of specific cyclases dictates the preferred 
pathway for cyclisation of the completed chain into an 
aromatic product (Hutchinson, C. R. and Fujii, I- Annu . 
Rev. Microbiol. (1995) 49:201-238). Hybrid polyketides 

10 have been obtained by the introduction of clones Type II 

PKS gene-containing DNA into another strain containing a 
different Type II PKS gene cluster, for example by 
introduction of DNA derived from the gene cluster for 
actinorhodin, a blue-pigmented polyketide from 

15 Streptomyces coellcolor , into an anthraquinone 

polyketide-producing strain of Streptomyces gallleus 
(Bartel, P. L . et al. J. Bacterid . (1990) 172:4816- 
4826) . 

The minimal number of domains required for 
20 polyketide chain extension on a Type II PKS when 

expressed in a Streptomyces coellcolor host cell (the 
"minimal PKS") has been defined for example in 
International Patent Application Number WO 95/08548 as 
containing the following three polypeptides which are 
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products of the act I genes: first P. S ; secondly a 
polypeptide termed the CLF with end-to-end amino acid 
sequence similarity to the KS but in which the essential 
active site residue of the KS , namely a cysteine residue, 
is substituted either by a glutamine residue, or in the 
5 case of the PKS for a spore pigment such as the whiE gene 

product (Chater, K. F. and Davis, N. K. Mol . Microbiol. 
(1990) 4:1679-1691) by a glutamic acid residue (Figure 
2); and finally an ACP. The CLF has been stated for 
example in International Patent Application Number WO 

10 95/08548 to be a factor that determines the chain length 

of the polyketide chain that is produced by the minimal 
PP:S . However it has been found (Shen, B. et al. J. Am. 
Chem. Soc. (1995) 117:6811-6821) that when the CLF for 
the octaketide actinorhodin is used to replace the CLF 

15 for the decaketide tetracenomycin in host cells of 

Streptomyces glaucescens , the polyketide product is not 
found to be altered from a decaketide to an octaketide, 
so the exact role of the CLF remains unclear. An 
alternative nomenclature has been proposed in which KS is 

20 designated KSa and CLF is designated KSp, to reflect this 

lack of knowledge (Meurer, G. et al. Chemistry and 
Biology (1997) 4:433-443). The mechanism by which 
acetate starter units and acetate extender units are 
loaded onto the Type II PKS is not known, but it is 
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speculated that the malonyl -CoA : ACP acyltransf erase of 
the fatty acid synthase of the host cell can fulfil the 
same function for the Type II PKS (Revill, W. P. et al . 
J. Bacterid. (1995) 17 7:3946-3952). 

International Patent Application Number WO 95/08548 
describes the replacement of actinorhodin PKS genes by 
heterologous DNA from other Type II PKS gene clusters, to 
obtain hybrid polyketides. The same International Patent 
Application WO 95/08548 describes the construction of a 
strain of Streptomyces coelicolor which substantially 
lacks the native gene cluster for actinorhodin, and the 
use in that strain of a plasmid vector pRM5 derived from 
the low-copy number vector SCP2* isolated from 
Streptomyces coelicolor (Bibb, M. J. and Hopwood, D. A. 
J. Gen. Microbiol. (1981) 126:427) and in which 
heterologous PKS-encoding DNA may be expressed under the 
control of the divergent act 1/ act III promoter region 
of the actinorhodin gene cluster (Fernandez-Moreno, M . A . 
et al. J. Biol. Chem. (1992) 2 67:19278-19290). The 
plasmid pRM5 also contains DNA from the actinorhodin 
biosynthetic gene cluster encoding the gene for a 
specific activator protein, ActII-orf4. The Act II-orf4 
protein is required for transcription of the genes placed 
under the control of the actl/ act II bidirectional 
promoter and activates gene expression during the 



transition from growth to stationary phase in the 
vegetative mycelium (Hallam, S. E. et al. Gene (1988) 
7 4 : 305-320) . 

Type II clusters in Streptomyces are known to be 
activated by pathway- speci f ic activator genes (Narva, K. 
5 E. and Feitelson, J. S. J. Bacteriol. (1990) 172:326-333; 

Stut zman-Engwall , K. J. et al . J. Bacteriol. (1992) 
174:144-154; Fernandez-Moreno, M.A. et al. Cell (1991) 
66:769-780; Takano, E. et al. Mol . Microbiol. (1992) 
6:2797-2804; Takano, E. et al . Mol. Microbiol. (1992) 

10 7:837-845), The DnrI gene product complements a mutation 

in the actll-orf 4 gene of 5. coel icolor , implying that 
DnrI and ActII-orf4 proteins act on similar targets. A 
gene (srmR) has been described (EP 0 524 832 A2 ) that is 
located near the Type I PKS gene cluster for the 

15 macrolide polyketide spiramycin. This gene specifically 

activates the production of the macrolide antibiotic 
spiramycin, but no other exampples have been found of 
such a gene. Also, no homologues of the Actll- 
or f 4 /DnrI /RedD family of activators have been described 

20 that act on Type I PKS genes. 

Although large numbers of therapeutically important 
polyketides have been identified, there remains a need to 
obtain novel polyketides that have enhanced properties or 
possess completely novel bioactivity. The complex 
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polyketides produced by Type I PKSs are particularly 
valuable, in that they include compounds with known 
utility as antihelminthics , insecticides, 

immunosuppressants, antifungal or antibacterial agents. 
Because of their structural complexity, such novel 



polyketides are not readily obtainable by total chemical 
synthesis, or by chemical modifications of known 
polyketides . 

There is also a need to develop reliable and 
specific ways of deploying individual modules in practice 
so that all, or a large fraction, of hybrid PKS genes 
that are constructed, are viable and produce the desired 
polyketide product. 

Pending International Patent Application number 
PCT/GB97/01818 discloses that a PKS gene assembly 
15 (particularly of Type I) encodes a loading module which 

is followed by at least one extension module. Thus Figure 
1 shows the organisation of the DEBS genes. The first 
open reading frame encodes the first multi-enzyme or 
cassette ( DEBS 1) which consists of three modules: the 
loading module (ery-load) and two extension modules 
(modules 1 and 2) . The loading module comprises an 
acyltransferase and an acyl carrier protein. This may be 
contrasted with Fig. 1 of WO 93/13663 (referred to 
above) . This shows ORF1 to consist of only two modules, 
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the first of which is in fact both the loading module and 
the first extension module. 

PCT/GB97/01818 describes in general terms the 
production of a hybrid PKS gene assembly comprising a 
loading module and at least one extension module. 
5 PCT/GB97/01818 also describes (see also Marsden, A. F. 

A. et al. Science (1998) 279:199-202) construction of a 
hybrid PKS gene assembly by grafting the wide-specificity 
loading module for the a vermect in-producing polyketide 
synthase onto the first multienzyme component (DEBS 1) 
10 for the erythromycin PKS in place of the normal loading 

module. Certain novel polyketides can be prepared using 
the hybrid PKS gene assembly, as described for example in 
pending International Patent Application number 
(PCT/GB97/01810) . Patent Application PCT /GB 97 / 0 1 8 1 8 
15 further describes the construction of a hybrid PKS gene 

assembly by grafting the loading module for the 
rapamycin-producing polyketide synthase onto the first 
multienzyme component (DEBS 1) for the erythromycin PKS 
in place of the normal loading module. The loading 
20 module of the rapamycin PKS differs from the loading 

modules of DEBS and the avermectin PKS in that it 
comprises a CoA ligase domain, an enoylreduct ase ("ER") 
domain and an ACP, so that suitable organic acids 
including the natural starter unit 3,4- 
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dihydroxycyclohexane carboxylic acid may be activated m 
situ on the PKS loading domain, and with or without 
reduction by the ER domain transferred to the ACP for 
intramolecular loading of the KS of extension module 1 
(Schwecke, T. et al. Proc. Natl. Acad. Sci . USA (1995) 



92 :7839-7843) . 

The DNA sequences have been disclosed for several 
Type I PKS gene clusters that govern the production of 
16-membered macrolide polyketides, including the tylosin 
PKS from Streptomyces fradiae (EP 0 791 655 A2 ) , the 
niddamycin PKS from Streptomyces caelestis (Kavakas, S. 
J. et al. J. Bacteriol. (1998) 17 9:7515-7522) and the 
spiramycin PKS from Streptomyces ambofaciens (EP 07 91 655 
A2) . All of these gene sequences have in common that 
they show the loading module of the PKS to differ from 
the loading module of DEBS and of the avermectin PKS in 
that they consist of a domain resembling the KS domains 
of the extension modules, an AT domain and an ACP (Figure 
3) . The additional N-terminal KS-like domain has been 
named KSq because it differs in each case from an 
extension KS by the specific replacement of the active 
site cysteine residue essential for p-ketoacyl-ACP 
synthase activity by a glutamine (Q in single letter 
notation) residue. The function of the KSq domain is 
unknown (Kavakas, S. J. et al. J. Bacteriol. (1998) 
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179:7515-7522), but its presence in these PKSs for 16- 
membered macrolides is surprising because the starter 
units of tylosin, niddamycin and spiramycin appear to be 
propionate, acetate and acetate respectively, that is, 
the same type of starter unit as in DEBS. The AT 
5 adjacent to the KSq domain is named here the ATq domain. 

When the entire loading module of the tylosin PKS 
was used to replace the analogous loading module in the 
spiramycin PKS in S. ambofaciens (Kuhstoss et al . Gene 
(1996) 183:231-236), the nature of the starting unit was 

10 stated to be altered from acetate to propionate. Since 

the role of the KSq domain was not understood, no 
specific disclosure was made that revealed either the 
importance of the KSq domain, or the possible utility of 
these KSq-containing loading modules in ensuring the 

15 purity of the polyketide product in respect of the 

starter unit, even at high levels of macrolide 
production. The interpretation for their results was 
stated as: "Therefore we believe that the experiments 
described here provide strong experimental support for 

20 the hypothesis that the AT domains in Type I PKS systems 

select the appropriate substrate at each step in 
synthesis" (Kuhstoss et al. Gene (1996) 183:231-236, at 
p. 235). These authors noted the analogy with the CLF 
protein in Type II PKS systems and that the latter 
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protein is thought to be involved in determining the 
chain length. They state: "KSq may serve a similar 
function, although it is unclear why such a function 
would be necessary in the synthesis of these 16-membered 
polyketides when it is not needed for the synthesis of 



5 other complex polyketides such as 6-DEB or rapamycin. In 

any case the KSq is unlikely to be involved in substrate 
choice at each step of synthesis." (Kuhstoss et al. Gene 
(1996) 183 :231-236) . 

It has been shown that when genetic engineering is 

10 used to remove the loading module of DEBS, the resulting 

truncated DEBS in S. erythraea continues to produce low 
levels of erythromycins containing a propionate starter 
unit (Pereda, A. et al. Microbiology (1995) 144:543-553) 
The same publication shows that when in this truncated 

15 DEBS the methylmalonyl-CoA -specific AT of extension 

module 1 was replaced by a malonyl-CoA-specif ic AT from 
an extension module of the rapamycin PKS, the products 
were also low levels of erythromycins containing a 
propionate starter unit, demonstrating that the origin of 

20 the starter units is not decarboxylation of the 

(methyl) malonyl groups loaded onto the enzyme by the AT 
of module 1, but from direct acylation of the KS of 
extension module 1 by propionyl-CoA . This is in contrast 
to a previous report, using partially purified DEBS1+TE, 
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a truncated bimodular PKS derived from DEBS (Kao, C. M . 
et al. J. Am. Chem. Soc. (1995) 117:9105-9106) and 
functionally equivalent to DEBS 1 -TE (Brown, M. J. B. et 
al., J. Chem. Soc. Chem. Commun . (1995) 1517-1518; 

Cortes, J. et al. Science (1991) 2 523:675-679), which 

5 stated that the origins of the starter units for DEBS can 

include methylmalona te units which are loaded onto module 
1 and are decarboxylated by the KS of module 1 (Pieper, 
R. et al. Biochemistry (1997) 3 6:1846-1851). It has now 
been found that when the DEBS 1 -TE protein is fully 

10 purified from extracts of recombinant S. erythraea it 

contains no such specific decarboxylase activity 
(Weissmann, K. et al . (1998) Biochemistry, in the press), 
further confirming that starter units do not in fact 
arise from decarboxylation of extension units mediated 

15 by the KS of extension module 1 . 

It is known that the DEBS loading module has a 
slightly broader specificity than propionate only, and in 
particular acetate starter units are used both in vitro 
and in vivo, when the PKS containing this loading module 

20 is part of a PKS that is expressed either in S. erythraea 

the natural host for erythromycin production (see for 
example Cortes, J. et al. Science (1995) 268:14 87-14 8 9), 
or in a heterologous host such as S. coellcolor (Kao, C. 
M. et al. J. Am. Chem. Soc. (1994) 116:11612-11613; 
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Brown, M . J. B . et al. J. Chem. Soc. Chem. Commun . (1995) 
1517-1519) . In vitro experiments using purified DEBS 1 -TE 
have demonstrated that propionyl-CoA and acetyl-CoA are 
alternative substrates that efficiently supply propionate 
and acetate units respectively to the loading module 
(Wiessmann, K . E. H. et al. Chemistry and Biology (1995) 
2:583-589; Pieper, R. et al. J. Am. Chem. Soc. (1995) 
117:11373-11374). The outcome of the competition between 
acetate and propionate starter units is influenced by the 
respective intracellular concentrations of propionyl-CoA 
and acetyl-CoA prevailing in the host cell used (see for 
example Kao, C. M. et al. Science (1994) 265:509-512; 
Pereda, A. et al. Microbiology (1995) 14 4:543-553). It 
is also determined by the level of expression of the host 
PKS, so that as disclosed for example in Pending 
International Patent Application number PCT/GB97 / 0 1 8 1 8 , 
when recombinant DEBS or another hybrid PKS containing 
the DEBS loading module is over-expressed in S. 
erythraea, the products are generally mixtures whose 
components differ only in the presence of either an 
acetate or a propionate starter unit . 

There is a need to develop reliable and methods for 
avoiding the formation of mixtures of polyketides with 
both acetate and propionate starter units, and to allow 
the specific incorporation of unusual starter units. It 
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has now been found, surprisingly, that the role of the 
loading domains in the PKSs for the 16-membered 
macrolides tylosin, niddamycin and spiramycin is 
dif f e rent from that of the loading domains of the 
avermectin PKS and of DEBS. It has been realised that 
5 the KSq domain of the tylosin PKS and the associated AT 

domain, which is named here ATq, together are responsible 
for the highly specific production of propionate starter 
units because the ATq is specific for the loading of 
methylmalonyl-CoA and not pr opionyl -CoA as previously 

10 thought; and the KSq is responsible for the highly 

specific decarboxylation of the enzyme-bound 
methylmalonate unit to form propionate unit attached to 
the ACP domain of the loading module and appropriately 
placed to be transferred to the KS of extension module 1 

15 for the initiation of chain extension. In a like manner 

the ATq of the spiramycin and niddamycin PKSs, and the 
adjacent KSq, are responsible for the specific loading of 
malonate units rather than acetate units as previously 
believed, and for their subsequent specific 

20 decarboxylation to provide acetate starter units for 

polyketrde chain extension. 

It has also now been found here that not only the 
PKSs for the above-mentioned 16-membered macrolides, but 
also the PKSs for certain 14-membered macrolides 
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particularly the oleandomycin PKS from Streptomyces 
antibioticus (Figure 4) and also the PKSs for certain 
polyether ionophore polyketides particularly the putative 
monensin PKS from Streptomyces cinnamonensis (Figure 4), 
possess a loading domain comprising a KSq domain, an ATq 



5 domain, and an ACP . In Figure 4 is shown a sequence 

alignment of the KSq domains and of the adjacent linked 
ATq domains that have been identified, showing the 
conserved active site glutamine (Q) residue in the KSq 
domains, and an arginine residue which is conserved in 
10 all extension AT domains and is also completely conserved 

in ATq domains. This residue is characteristically not 
arginine in the AT domains of either DEBS or of the 
avermectin PKS loading modules, where the substrate for 
the AT is a non-carboxylated acyl-CoA ester (Haydock, S. 
15 F. et al. FEBS Letters (1995) 374:246-248) . The 

abbreviation ATq is used here to simply to distinguish 
the AT domains found immediately C-terminal of Ksq from 
extension ATs, and the label has no other significance. 
In one aspect the invention provides a PKS 
20 multienzyme or part thereof, or nucleic acid (generally 

DNA) encoding it, said multienzyme or part comprising a 
loading module and a plurality of extension modules, 
wherein 

(a) the loading module is adapted to load a malonyl 
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or substituted malcnyl residue and then to effect 
decarboxylation of the loaded residue to provide an 
acetyl or substituted acetyl (which term encompasses 
propionyl) residue for transfer to an extension module; 
and 

5 (b) the extension modules, or at least one thereof 

(preferably at least the one adjacent the loading 
module), are not naturally associated with a loading 
module that effects decarboxylation of an optionally 
substituted malonyl residue. 
10 Generally the loading module will also include an 

ACP (acyl carrier protein) domain. 

Preferably the deca rboxyla t ing functionality of the 
loading module is provided by a KS (ketosynthase) -type 
domain. Suitably this differs from a KS of a 
15 conventional extension module by possessing a glutamine 

residue in place of the essential cysteine residue in the 
active site. It is termed Ksq. It may be "natural" or 
genetically engineered, e.g. resulting from site-directed 
mutagenesis of nucleic acid encoding a different KS such 
20 as a KS of an extension module. 

Alternatively the uecarboxylat ing functionality can 
be provided by a CLF-type domain of the general type 
occuring in Type II PKS systems. 

Preferably the loading functionality is provided by 
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an AT ( acyl trans f erase ) -type domain which resembles an AT 
domain of a conventional extension module in having an 
arginine residue in the active site, which is not the 
case with the AT domains of loader modules which load 
acetate or propionate, e.g. in DEBS or avermectin PKS 



systems. It may be termed Atq. Once again, it may be 
"natural" or genetically engineered, e.g. by mutagenesis 
of an AT of an extension module. 

Usually the loading module will be of the form: 

Ksq-ATq-ACP 

where ACP is acyl carrier protein. 

In another aspect the invention provides a method of 
synthesising a polyketide having substantially 
exclusively a desired starter unit by providing a PKS 
multienzyme incorporating a loading module as defined 
above which specifically provides the desired starter 
unit. This may comprise providing nucleic acid encoding 
the multienzyme and introducing it into an organism where 
it can be expressed. 

In further aspects the invention provides vectors 
and transformant organisms and cultures containing 
nucleic acid encoding the multienzyme. A preferred 
embodiment is a culture which produces a polyketide 
having a desired starter unit characterised by the 
substantial absence of polyketides with different starter 
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units. Thus, for example, erythromycin can be produced 
substantially free from analogues resulting from the 
incorporation of acetate starter units in place of 
propionate . 

Preferably the hybrid PKS encodes a loading module 
5 and from 2 to 7 extension modules and a chain terminating 

enzyme (generally a thioes terase ) . 

It is particularly useful to provide a loading 
module of the type KSq - ATq-ACP for a PKS gene assembly 
which produces a 12-, 14- or 16-membered macrolide in 

10 order to prepare a 12-, 14- or 16-membered macrolide 

which contains exclusively or almost exclusively an 
acetate starter unit, even when such PKS gene assembly is 
expressed at high levels in an actinomycete host cell . 
Particularly suitable PKSs for this purpose are the 

15 components of PKSs for the biosynthesis of erythromycin, 

methymycin, oleandomycin, tylosin, spiramycin, 
midecamycin, and niddamycin for all of which the gene and 
modular organisation is known at least in part. 
Particularly suitable sources of the genes encoding a 

20 loading module of the type KSq - ATq-ACP are the loading 

modules of oleandomycin, spiramycin, niddamycin , 
methymycin and monensin which are specific for the 
loading of malonate units which are then decarboxyla ted 
to acetate starter units. 
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Similarly it is particularly useful to provide a 
loading module of the type KSq - ATq-ACP for a PKS gene 
assembly which produces the macrolides rifamycin, 
avermectin, rapamycin, immunomycin and FK506 whose 
loading modules possess an unusual specificity (and for 
all of which the gene and modular organisation is known 
at least in part) in order to prepare such macrolides 
containing exclusively or almost exclusively an acetate 
starter unit, even when such PKS gene assembly is 
expressed at high levels in an actinomycete host cell. 
Particularly suitable sources of the genes encoding a 
loading module of the type KSq - ATq-ACP are the loading 
modules of oleandomycin, spiramycin, niddamycin, 
methymycin and monensin which are specific for the 
loading of malonate units which are decarboxylated to 
acetate starter units. 

It is similarly useful to provide a loading module 
of the type KSq - ATq-ACP for a PKS gene assembly which 
produces a 12-, 14- or 16-membered macrolide in order to 
prepare a 12-, 14- or 16-membered macrolide which 
contains exclusively or almost exclusively a propionate 
starter unit, even when such PKS gene assembly is 
expressed at high levels in an actinomycete host cell. 
Particularly suitable PKSs for this purpose are the 
components of PKSs for the biosynthesis of erythromycin, 
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methymycin, oleandomycin, tylosin, spiramycin, 
midecamycin, and niddamycin for all of which the gene and 
modular organisation is known at least in part. A 
particularly suitable source of the genes encoding a 
loading module of the type KSq-ATq-ACP is the loading 
5 module of tylosin which is specific for the loading of 

methylmalona te units which are decarboxylated to 
propionate starter units. 

Similarly it is particularly useful to provide a 
loading module of the type KSq - ATq-ACP for a PKS gene 
10 assembly which produces the macrolides rifamycin, 

avermectin, rapamycin, immunomycin and FK506 whose 
loading modules possess an unusual specificity (and for 
all of which the gene and modular organisation is known 
at least in part) in order to prepare such macrolides 
15 containing exclusively or almost exclusively a propionate 

starter unit, even when such PKS gene assembly is 
expressed at high levels in an actinomycete host cell. A 
particularly suitable source of the genes encoding a 
loading module of the type KSq - ATq-ACP is the loading 
20 module of tylosin which is specific for the loading of 

me thylmalona te units wrirch are decarboxylated to 
propionate starter units . 

In the loading module of the type KSq - ATq-ACP the 
domains or portions of them may be derived from the same 
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or from different sources, and comprise either natural or 
engineered domains. For example the ATq domain can be 
replaced by an AT domain derived from any extension 
module of a Type I PKS, having specificity either for 
loading of malonate units or for loading of 



methylmalonate units respectively, so long as the KSq 
domain is chosen to have a matching specificity towards 
either methylmalonate or malonate units respectively. 

Alternatively, the KSq domain in the loading module 
provided of the type KSq - ATq-ACP may be substituted by 
the CLF polypeptide of a Type II PKS. It is now apparent 
that in contrast to its previous identification as a 
factor uniquely determining chain length, the CLF, in 
addition to any other activities that it may possess, is 
the analogue of the KSq domain and can act as a 
decarboxylase towards bound malonate units. 

The loading module of the type KSq - ATq-ACP may be 
linked to a hybrid PKS produced for example as in 
PCT/GB97/01818 and PCT/GB97 / 0 18 1 0 . It is particularly 
useful to link such a loading module to gene assemblies 
that encode hybrid PKSs that produce novel derivatives of 
14-membered macrolides as described for example in 
PCT/GB97/01818 and PCT/GB97 /018 1 0 . 

The invention further provides such PKS assemblies 
furnished with a loading module of the type KSq - ATq- 



- 24 - 

ACF, vectors containing such assemblies, and transfcrmant 
organisms that can express them. Transformant organisms 
may harbour recombinant plasmids, or the plasmids may 
integrate. A plasmid with an int sequence will integrate 
into a specific attachment site (att) of the host's 
5 chromosome. Transformant organisms may be capable of 

modifying the initial products, eg by carrying out all or 
some of the biosynthetic modifications normal in the 
production of erythromycins (as shown in Figure 5) and 
for other polyketides. Use may be made of mutant 

10 organisms such that some of the normal pathways are 

blocked, e.g. to produce products withut one or more 
"natural" hydroxy-groups or sugar groups. The invention 
further provides novel polyketides as producible, 
directly or indirectly, by transformant organisms. This 

15 includes polyketides which have undergone enzymatic 

modification . 

In a further aspect the invention provides both 
previously-obtained polyketides and novel polyketides in 
a purer form with respect to the nature of the starter 

20 unit, than was hitherto possible. These include 12- , 

14- and 16-membered ring macrolides which are either 
"natural'' or may differ from the corresponding "natural" 
compound : 
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a) in the oxidation state of one or more of the ketide 
units (ie selection of alternatives from the group: -CO-, 
-CH(OH)-, alkene -CH-, and -CH 2 - ) where the 
stereochemistry of any -CH(OH)- is also independently 
selectable; 



b) in the absence of a "natural" methyl side-chain; or 

c) in the stereochemistry of "natural" methyl; and/or 
ring substituents other than methyl. 

10 it is also possible to prepare derivatives of 12-, 

14- and 16-membered ring macrolides having the 
differences from the natural product identified in two or 
more of items a) to c) above. 

Derivatives of any of the afore-mentioned 

15 polyketides which have undergone further processing by 

non-PKS enzymes, eg one or more of hydroxylation , 
epoxidation, glycosylat ion and methylation may also be 
prepared . 

The present invention provides a novel method of 
20 obtaining both known and novel complex polyketides 

without the formation of mixtures of products differing 
only in having either an acetate or a propionate starter 
unit. In addition the present invention provides a 
method to obtain novel polyketides in which the starter 



- 26 - 

unit is an unusual starter unit which is derived by the 
action of a KSq domain on the enzyme-bound product of an 
AT of unusual specificity derived from an extension 
module of a natural Type I PKS. In particular the AT of 
extension module 4 of the FK506 PKS gene cluster 
5 preferentially incorporates an allyl side-chain; the AT 

of extension module 6 of the niddamycin PKS gene cluster 
preferentially incorporates a sidechain of structure 
HOCH 2 -; and the ATs of extension module 5 of spiramycin 
and of extension module 5 of monensin incorporate an 

10 ethyl side chain. In each case the KSq domain is 

preferentially one that is naturally propionate-specif ic , 
Alternatively, any KS from an extension module of a Type 
I PKS may be converted into a KSq domain capable of 
decarboxylating a bound carboxylated acyl thioester, by 

15 site-directed mutagenesis of the active site cysteine 

residue to replace it by another residue, preferably 
glutamine. It is known that the animal fatty acid 
synthase, which shares many mechanistic features with 
Type I PKS, in the absence of acetyl-CoA, has a 

20 demonstrable malonyl-CoA decarboxylase activity (Kresze, 

G. B. ct al. Eur. J. Biochem. (1977) 79:191-199). When 
treated with an alkylating agent such as iodoacetamide 
the fatty acid synthase is inactivated by specific 
modification of the active site cysteine of the KS , and 
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the resulting protein has an enhanced malonyl-CoA 
decarboxylase activity. The conversion of a fatty acid 
KS domain into a decarboxylase mirrors the genetically- 
determined change between the KS domains and the KSq 
domain in Type I PKSs. Indeed, the size and polarity 
characteristics of a glutamine side chain very closely 
approximate those of carboxamido-cysteine . The KSq to be 
used for decarboxylation of an unusual alkylmalonate unit 
is preferably selected from the same extension module of 
the same Type I PKS that provides the unusual AT, in 
order to optimise the decarboxylation of the unusual 
alkylmalonate, and the ACP to be used is preferably also 
the ACP of the same extension module. 

Suitable plasmid vectors and genetically engineered 
cells suitable for expression of PKS genes incorporating 
an altered loading module are those described in 
PCT/GB97/01818 as being suitable for expression of hybrid 
PKS genes of Type I. Examples of effective hosts are 
Saccharopolyspora erythraea, Streptomyces coelicolor, 
Streptomyces avermitilis, Streptomyces gr iseofuscus , 
Streptomyces cinnamonensis , Streptomyces fradiae, 
Streptomyces longisporo flaws , Streptomyces 
hygroscopicus, Micromonospora griseorubida, Streptomyces 
lasaliensis, Streptomyces venezuelae, Streptomyces 
antibiot.cus, Streptomyces lividans, Streptomyces 
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rimcsus, Streptomyces albus, Amycolatops is mediterranei , 
and Streptomyces tsukubaens is . These include hosts in 
which SCP2 * -derived plasmids are known to replicate 
autonomously, such as for example S. coelicolor , S. 
avermit il is and 5. griseofuscus; and other hosts such as 
5 Saccharopolyspora erythraea in which SCP2 * -derived 

plasmids become integrated into the chromosome through 
homologous recombination between sequences on the plasmid 
insert and on the chromosome; and all such vectors which 
are integrati vely transformed by suicide plasmid vectors. 
10 Some embodiments of the invention will now be 

described with reference to the accompanying drawings in 
which : 

Fig 1 is a diagram showing the functioning of 6- 
deoxyerythronolide B synthase (DEBS), a modular PKS 

15 producing 6-deoxyerythronol ide B (6-DEB) a precursor of 

erythromycin A. 

Fig 2 gives the amino acid sequence comparison of 
the KS domains and the CLF domains of representative Type 
II PKS gene clusters. The active site Cysteine (C) of 

20 the KS domains is arrowed in the Figure and aligns with 

the Glutamine (Q) or glutamic acid (E) of the CLF 
domains. The abbreviations used, and the relevant 
Genbank/EMBL accession numbers are: GRA: granaticin from 
Streptomyces v iolaceoruber (X63449); HIR: unknown 
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pclyketide from Saccharopolyspora hirsuta (M98258); ACT, 
actmorhodm from Streptomyces coelicolor (X63449); CIN: 
unknown polyketide from Streptomyces cinnamonensis 
(Z1151D; VNZ: jadomycin from Streptomyces venezuelae 
(L33245) ; NOG: anthracycline s from Streptomyces nogalater 
5 (Z48262); TCM: tet racenomycin from S. glaucescens 

(M80674); DAU : daunomycin from Streptomyces sp. C5 
(L34880) ; PEU, doxorubicin from Streptomyces peucetius 
(L35560) ; WHI : WhiE spore pigment from Streptomyces 
coelicolor (X55942) . 
10 Fig 3 shows the gene organisation of the PKSs for 

three 16-membered ring macrolides, tylosin, spiramycin 
and niddamycin. 

Fig 4 shows the amino acid sequence alignment of 
KSq-ATq loading didomains of the PKSs for niddamycin, 
15 platenolide (spiramycin) , monensin, oleandomycin and 

tylosin. The sequences for the monensin and oleandomycin 
loading didomains have not been previously. disclosed. 

Fig. 5 The enyzmatic steps that convert 6- 
deoxyerythronolide B into erythromycin A in 
20 Saccharopolyspora erythraea 

The present invention will now be illustrated, but 
is not intended to be limited, by means of some examples. 
Example 1 

Construction of the Recombina nt Vector pPEX43 
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Plasmid pCJR24 was prepared as described in 
PCT/GB97/01819. pMDITE is a pCJR24-based plasmid 
containing the gene encoding a hybrid polyketide synthase 
that contains the putative monensm PKS loading module 
(isolated from 5. cinnamonensls) the DEBS extension 
5 modules 1 and 2 and the chain- terminating thioes terase . 

Plasmid pPFL43 was constructed as follows: 

The following synthetic oligonucleotides: 5'- 
CCATATGGCCGCATCCGCGTCAGCGT-3 ' and 5'- 

10 GGCTAGCGGGTCCTCGTCCGTGCCGAGGTCA-3 ' 

are used to amplify the DNA encoding the putative 
monensin-producing loading module using a cosmid that 
contains the 5' end of the putative monensin-producing 
PKS genes from S. cinnamonensis or chromosomal DNA of S. 

15 cinnamonensls as template. The PCR product of 3.3 kbp is 

purified by gel electrophoresis, treated with T4 
polynucleotide kinase and ligated to plasmid pUC18, which 
has been linearised by digestion with Sma I and then 
treated with alkaline phosphatase. The ligation mixture 

20 was used to transform eiectrocompetent E.coll DH10B cells 

and individual clones were checked for the desired 
plasmid pPFL40. Plasmid pPFL40 was identified by 
restriction pattern and sequence analysis. 

Plasmid pPFL40 was digested with AZde I and Nhe I and 
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the 3.3 kbp fragment was purified by gel electrophoresis 
and ligated to PP ND30-His previously digested with Nde I 
and Nhe I and treated with alkaline phosphatase. The 
ligation mixture was used to transform electrocompetent 
E.coli DH10B cells an d individual clones were checked for 
5 the desired plasmid P PFL43. Plasmid pPFL43 was 

identified by restriction analysis. 
Example __2 

r^^mction ^ S. ervthraea JC2 / pPFL43 

Plasmid pPFL43 was used to transform S.erythraea JC2 
10 protoplasts. The construction of strain JC2 from which 

the resident DEBS genes are substantially deleted is 
given in Pending Patent Application PCT/GB97 /01818 . 
Thiostrepton resistant colonies were selected in R2T20 
medium containing 10 g/ml of thiostrepton. Several 
15 clones were tested for the presence of P PFL43 integrated 

into the chromosome by Southern blot hybridisation of 
their genomic DNA with DIG-labelled DNA containing the 
mon PKS fragment encoding for the loading module. 
Example 3 

20 frH:-^- poWlcgtide* *• erythr*** JC2/pPFL43 

A frozen suspension of strain S. erythraea JC2 / 
pPFL4 3 was inoculated in eryP medium, which has the 
following composition, 
pryP medium 
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dextrose 50 g per 1 
nutrisoy flour 30 g per 1 
(NH4 ) 2 SO, 3 g per 1 
CaC0 5 6 g per 1 

NaCl 5 g per 1 



5 pH = 7 . 0 

and containing 5 g/ml of thiostrepton . The 
inoculated culture was allowed to grow for seven days at 
28-30°C. After this time the broth was filtered to 
remove mycelia and the pH adjusted to pH=3 . 0 . The broth 

10 was extracted twice with two volumes of ethyl acetate and 

the combined extracts were washed with an equal volume of 
saturated sodium chloride, dried over anhydrous sodium 
sulphate, and the ethyl acetate was removed under reduced 
pressure, to give crude product. The product was shown 

15 to have the structure shown below, and by MS, GC-MS and 

X H NMR was found to be identical to an authentic sample. 



OH 



33 - 



10 



15 



20 



Example A 

r.n g ^ 1C tior »+■ S- «rvth™-« NBTU.?338/ P MD1TE 

Plasmid P PFL43 was used to transform S. erythraea 
NRRL2338 protoplasts. Thiostrepton resistant colonies 
■ were selected m K^U medium co ntaining 1U g/ml ul 
thiostrepton. Several clones were tested for the 
presence of pPFL43 integrated into the chromosome by 
Southern blot hybridisation of their genomic DNA with 
DIG-labelled DNA containing the mon PKS fragment 
encoding for the loading module. A clone with an 
integrated copy of P PFL43 was selected in this way. 
Exam ple 5 

Production PQ lvket. irle* u sing S er ythraea NRRL 
7 33B/ pPFL43 

A frozen suspension of strain S. erythraea/pMDITE 
was used to inoculate eryP medium containing 5 g/ml of 
thiostrepton and allowed to grow for seven days at 28- 
30°C. After this time the broth was filtered to remove 
mycelia and the P H adjusted to P H=9 . The supernatant was 
then extracted three times with an equal volume of ethyl 

acetate and the solvent was removed by evaporation. 

Products were analysed by HPLC/MS and one macrolide was 

identified with the structure shown, and identical by MS, 

GC-MS, and MS-MS with authentic material: 





Construction of the Recombinant Vector pPFIi42 

Plasmid pPFL42 is a pCJR24 -based plasmid containing 
5 the gene encoding a hybrid polyketide synthase that 

contains the t ylosin-producing PKS loading module, the 
erythromycin extension modules 1 and 2 and the chain- 
terminating thioesterase . Plasmid pPFL42 was constructed 
as follows : 



10 



10 



15 



20 



- 35 - 

The following synthetic oligonucleotides: 

5 • -CCATATGACCTCGAACACCGCTGCACAGAA-3 1 and 

5 • -GGCTAGCGGCTCCTGGGCTTCGAAGCTCTTCT-3 ' 

wer e used to amplify the DNA encoding the tylosin- 

producing loading module using either cos6T (a cosmid 



fradiae) or chromosomal DNA from S. fradiae as template. 
The PGR product of 3.3 kbp was purified by gel 
electrophoresis, treated with T4 polynucleotide kinase 
and ligated to plasmid P UC18, which had been linearised 
by digestion with Stna I and then treated with alkaline 
phosphatase. The ligation mixture was used to transform 
electrocompetent E.coli DH10B cells and individual clones 
were checked for the desired plasmid P PFL39. Plasmid 
pPFL 39 was identified by restriction and sequence 
analysis . 

Plasmid PPFL39 was digested with Nde I and Nhe I and 
the 3.3 kbp fragment was purified by gel electrophoresis 
and ligated to P ND30 previously digested with Nde I and 
Nhe i and treated with alkal.ne phosphatase. The ligation 
mi xture was used to transform electrocompetent E.coli 
DH10B cells and individual clones were checked for the 

desired plasmid P PFL42. Plasmid P PFL42 was identified by 

restriction analysis. 

Bvample 7 
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Construction of S. erythraea JC2/pPFL42 

Plasmid pPFL42 was used to transform S. erythraea 
JC2 protoplasts . Thios t repton resistant colonies were 
selected in R2T20 medium containing 10 g/ml of 
thios t repton . Several clones were tested for the presence 

5 Of p P FL42 integrated into the c hr omos ome b y Southern blot 

hybridisation of their genomic DNA with DIG-labelled DNA 
containing the tyl PKS fragment encoding for the loading 
module. A clone with an integrated copy of pPFL42 was 
identified in this way, 

10 

Example 8 

Production of polyke tides using S. erythraea NRRL 
2338/pTDlTE 

A frozen suspension of strain 5. erythraea NRRL 
15 2338/pPFL42 was used to inoculate eryP medium containing 

5 g/ml of thiostrepton and allowed to grow for seven 
days at 28-30°C. After this time the broth was filtered 
to remove mycelia and the pH adjusted to pH=3 . The broth 
was extracted twice with two volumes of ethyl acetate and 
20 the combined extracts were washed with an equal volume of 

saturated sodium chloride, dried over anhydrous sodium 
sulphate, and the ethyl acetate was removed under reduced 
pressure, to give crude product. The product was shown 
to have the structure shown below, and was identical, as 




10 



Example 9 

r »™ c fcior ft. ^rvtlir^w NBPT.?338/pPFL42 

Plasmid pPFL42 was used to transform S. erythraea 
NRRL2338 protoplasts. Thiostrepton resistant colonies 
were selected in R2T20 medium containing 10 g/ml of 
thiostrepton. Several clones were tested for the presence 
of P PFL42 integrated into the chromosome by Southern blot 
hybridrsation of their genomic DNA with DIG-labelled DNA 
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containing the tyl PKS fragment encoding for the loading 
module. A clone with an integrated copy of pPFL42 was 
identified in this way. 



Example 10 



5 Production of polyke tides using S. exythraea. 

NRRL2 338/pPFL42 

A frozen suspension of strain S. erythraea 
NRRL2 338 /pTDlTE was used to inoculate eryP medium 
containing 5 g/ml of thiostrepton and allowed to grow 

10 for seven days at 28-30°C. After this time the broth was 

filtered to remove mycelia and the pH adjusted to pH=9. 
The supernatant was then extracted three times with an 
equal volume of ethyl acetate and the solvent was removed 
by evaporation. Products were analysed by HPLC/MS and a 

15 macrolide was identified with the following structure, 

identical with that of authentic erythromycin A (together 
with other products, which were identified as the 
corresponding erythromycins B and D, the result of 
incomplete post-PKS processing) : 
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Example 11 

Construction of plasmid pPFL35 

Plasmid pPFL35 is a pCJR24-based plasmid containing a 
PKS gene comprising a loading module, the first and second 
extension modules of DEBS and the chain terminating 
5 thioesterase . The loading module comprises the KSq domain 

DNA from the loading module of the oleandomycin PKS fused 
to the malonyl-CoA-specific AT of module 2 of the rapamycin 
PKS, in turn linked to the DEBS loading domain ACP. Plasmid 
pPFL35 was constructed via several intermediate plasmids as 

10 follows: 

A 411 bp DNA segment of the eryAI gene from 
S.erythraea extending from nucleotide 1279 to nucleotide 
1690 (Donadio, S. et al . , Science (1991) 2523:675-67 9) was 
amplified by PCR using the following synthetic 

15 oligonucleotide primers :- 

5 ' -TGGACCGCCGCCAATTGCCTAGGCGGGCCGAACCCGGCT-3 • and 
5 ' -CCTGCAGGCCATCGCGACGACCGCGACCGGTTCGCC-3 ' 

The DNA from a plasmid designated pKSW, derived from 
pT7-7 and DEBS1-TE in which new Pst I and Hindi I I sites had 

20 been introduced to flank the KS1 of the first extension 

module, was used as a template. The 441 bp PCR product was 
treated with T4 polynucleotide kinase and ligated to 
plasmid pUC18, which had been linearised by digestion with 
Sma I and then treated with alkaline phosphatase. The 
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ligation mixture was used to transform elect rocompetent 
E.coli DH10B cells and individual clones were checked for 
the desired plasmid, pPFL26. The new Mfe I/Avr II sites 
bordering the insert are adjacent to the Eco RI site in the 
polylinker of pUC18. Plasmid pPFL26 was identified by 
5 restriction pattern and sequence analysis . 

An Mfe I restriction site is located 112 bp from the 
5 1 end of the DNA encoding the propionyl-CoA : ACP 
transferase of the loading module of DEBS. Plasmid pKSW was 
digested with Mfe I and Pst I and ligated with the 411 bp 
10 insert obtained by digesting plasmid pPFL26 with Mfe I and 

Pst I. The ligation mixture was used to transform 
electrocompetent E.coli DH10B cells and individual clones 
were checked for the desired plasmid, pPFL27 . Plasmid 
pPFL27 contains a PKS gene comprising the DEBS loading 
15 module, the first and second extension modules of DEBS and 

the DEBS chain terminating thioes terase . Plasmid pPFL27 was 
identified by its restriction pattern. 

Plasmid pPFL27 was digested with Nde I and Avr II and 
ligated to a 4.6kbp insert derived from digesting plasmid 
20 pM06 (PCT/GB97/01818) with Nde I and Avr II. Plasmid pM06 

contains a PKS gene comprising the DEBS loading module, the 
first and second extension modules of DEBS and the DEBS 
chain terminating thioes terase , except that the DNA segment 
encoding the methylmalona te-specif ic AT within the first 
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extension module has been specifically substituted by th 
DNA encoding the malonate-speci f ic AT of module 2 of the 
rap PKS. The ligation mixture was used to transform 
electrocompetent E. coli DH10B cells and individual clones 
were checked for the desired plasmid, pPFL28. Plasmid 
5 pPFL28 contains a hybrid PKS gene comprising the DEBS 

loading module, the malonate-specif ic AT of module 2 of the 
rap PKS, the ACP of the DEBS loading module, followed by 
the first and second extension modules of DEBS and the DEBS 
chain terminating thioesterase . Plasmid pPFL28 was 
10 identified by restriction analysis. 

A DNA segment encoding the KSq domain from the oleAI 
gene of S. antlbiotlcus extending from nucleotide 1671 to 
nucleotide 3385 was amplified by PCR using the following 
synthetic oligonucleotide primers:- 
15 5 ' -CCACATATGCATGTCCCCGGCGAGGAA-3 ' and 

5 ' -CCCTGTCCGGAGAAGAGGAAGGCGAGGCCG- 3 ' 

and chromosomal DNA from Streptomyces antibioticus as a 
template. The PCR product was treated with T4 
polynucleotide kinase and ligated to plasmid pUC18, which 
20 had been linearised by digestion with Sma I and then 

treated with alkaline phosphatase. The ligation mixture was 
used to transform electrocompetent E. coli DH10B cells and 
individual clones were checked for the desired plasmid, 
P PFL31. The new Nde I site bordering the insert is adjacent 
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tc the Eco RI site of the pUC18 polylinker while the new 
Bsp EI site borders the Hin dill site of the linker region. 
Plasmid pPFL31 was identified by restriction and sequence 
analysis . 

Plasmid pPFL31 was digested with Nde I and A\rr II and 
5 the insert was ligated with plasmid pPFL28 that had been 

digested with Nde I and Avr II. The ligation mixture was 
used to transform electrocompetent E.coll DH10B cells and 
individual clones were checked for the desired plasmid, 
pPFL32 . Plasmid pPFL32 was identified by restriction 
10 analysis . 

Plasmid pPFL32 was digested with Nde I and Xba I and 
the insert was ligated to plasmid pCJR24, which had been 
digested with Nde I and Xba I and purified by gel 
electrophoresis. The ligation mixture was used to transform 
15 electrocompetent E.coli DH10B cells and individual clones 

were checked for the desired plasmid, pPFL35 . Plasmid 
pPFL35 was identified by restriction analysis. 



Example 12 

2 0 Construction of S . exrythraea JC2 / pPFL35 

Plasmid pPFL35 was used to transform S. erythraea JC2 
protoplasts. Thiostrepton resistant colonies were selected 
in R2T20 medium containing 10 g/ml of thiostrepton. 
Several clones were tested for the presence of pPFL35 
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integrated into the chromosome by Southern blot 
hybridisation of their genomic DNA with DIG-labelled DNA 
containing the rap PKS fragment encoding for module 2 
acyltransf erase. A clone with an integrated copy of P PFL35 
was identified in this way. 



10 



15 



Example 13 

Pmduction of polvketides using S. eryt hrava JC2 / P PFL35 

A frozen suspension of strain S. erythraea JC2 / 
P PFL35 was used to inoculate eryP medium containing 5 g/ml 
of thiostrepton and allowed to grow for seven days at 28- 
30°C. After this time the broth was filtered to remove 
mycelia and the pH adjusted to P H=3 . The broth was 
extracted twice with two volumes of ethyl acetate and the 
combined extracts were washed with an equal volume of 
saturated sodium chloride, dried over anhydrous sodium 
sulphate, and the ethyl acetate was removed under reduced 
pressure, to give crude product. The product was shown to 
have the following structure, and was found by MS, GC-MS 
and X H NMR to be identical to authentic material: 



OH 



20 
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Example 14 

Construction of S. erythraea NRRL2338/pPFL35 

Plasmid pPFL35 was used to transform S.erythraea 
NRRL2338 protoplasts. Thiostrepton resistant colonies were 
selected in R2T20 medium (Yamamoto et al.) containing 10 
5 g/ml of thiostrepton. Several clones were tested for the 

presence of pPFL35 integrated into the chromosome by 
Southern blot hybridisation of their genomic DNA with DIG- 
labelled DNA containing the rap PKS fragment encoding for 
module 2 AT . A clone with an integrated copy of pPFL35 was 
10 identified in this way. 



Example 15 

Production of polyketides using S. eirvthraea 

15 NRRL2 338 /pPFL35 

A frozen suspension of strain S. erythraea 
NRRL2338/pPFL35 was used to inoculate eryP medium 
containing 5 g/ml of thiostrepton and allowed to grow for 
seven days at 28-30°C. After this time the broth was 
20 filtered to remove mycelia and the pH was adjusted to pH=9. 

The supernatant was then extracted three times with an 
equal volume of ethyl acetate and the solvent was removed 
by evaporation. Products were analysed by HPLC/MS and the 
following macrolide was identified, and its structure as 
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determined by MS and : H NMR was found to be identical to 
that of authentic material (it was accompanied by the 
corresponding erythromycins B and D, the products of 
incomplete processing by post-PKS enzymes: 




Example 1 6 

mnstmcti nn of Recombinant Vector p PFL44 

Plasmid P PFL44 is a pCJR24- based plasmid containing 
10 the gene encoding a hybrid polyketide synthase that 

contains the spiramycin PKS loading module, the 

erythromycin extension modules 1 and 2 and the chain- 
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terminating thioesterase . Plasmid pPFL44 was constructed 
as follows : 



The following synthetic oligonucleotides: 

5 ' -CCATATGTCTGGAGAACTCGCGATTTCCCGCAGT-3 ' and 



5 5 ' -GGCTAGCGGGTCGTCGTCGTCCCGGCTG-3 ' 

were used to amplify the DNA encoding the spiramycin- 
producing loading module using chromosomal DNA from the 
spiramycin producer S. a/njbor~aciens prepared according to 
the method described by Hopwood et al . (1985). The PCR 

10 product of 3.3 kbp was purified by gel electrophoresis, 

treated with T4 polynucleotide kinase and ligated to 
plasmid pUCl 8 , which had been linearised by digestion with 
Sma 1 and then treated with alkaline phosphatase. The 
ligation mixture was used to transform elect rocompetent 

15 E.coli DH10B cells and individual clones were checked for 

the desired plasmid pPFL41. Plasmid pPFL41 was identified 
by restriction pattern and sequence analysis. 



Plasmid pPFL41 was digested with Nde I and Nhe I and the 
20 3.3 kbp fragment was purified by gel electrophoresis and 

ligated to pND30 ( a plasmid derived from plasmid pCJR24 
having as insert the ave PKS loading module and extension 
modules 1 and 2 or DEBS and the DEBS thioesterase) 
(PCTGB97/01810) previously digested with Nde I and Nhe I 
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and treated with alkaline phosphatase. The ligation 
mixture was used to transform electrocompetent E.coli DH10B 
cells and individual clones checked for the desired plasmid 
P PFL44. Plasmid pPFL44 was identified by restriction 
analysis. 



10 



15 



Example 17 

^ ^■^■M^ of S. ezrvthTvma J C2/ P SD1TE 

Plasmid P PFL44 was used to transform S . erythraea JC2 
protoplasts. Thiostrepton resistant colonies were selected 
in R2T20 medium containing 10 g/ml of thiostrepton. 
Several clones were tested for the presence of P PFL44 
integrated into the chromosome by Southern blot 
hybridisation of their genomic DNA with DIG-labelled DNA 
containing the srm PKS fragment encoding for the loading 
module. A clone with an integrated copy of P PFL44 was 
identified in this way. 



20 



Example 18 

Urn!... ■ im - r -i T *-"-" S - ervthr ar* JC2 /pPFL44 

A frozen suspension of strain S. erythraea JC2/ P PFL44 
was used to inoculate eryP medium containing 5 g/ml of 
thiostrepton and allowed to grow for seven days at 28-30°C. 
After this time the broth was filtered to remove mycelia 
and the P H adjusted to P H=3 . The broth was extracted twice 




with two volumes of ethyl acetate and the combined extracts 
were washed with an equal volume of saturated sodium 
chloride, dried over anhydrous sodium sulphate, and the 
ethyl acetate was removed under reduced pressure, to give 
crude product. The product was shown to have the structure 
5 shown below and by GC-MS and : H NMR analysis was identical 

to authentic material : 




10 Example 19 

Construction of S. ervthraea NRRL2338/pPFL44 

Plasmid pPFL44 was used to transform S . erythraea 
NRRL2338 protoplasts. Thiostrepton resistant colonies were 



10 



g/ml of 



selected in R2T20 medium containing 
thiostrepton. Several clones were tested for the presence 
of pPFL4 4 integrated into the chromosome by Southern blot 
hybridisation of their genomic DNA with DIG-labelled DNA 
containing the spiramycin PKS fragment encoding for the 
loading module. A clone with an integrated copy of pPFL44 
was grown was identified in this way. 



Example 20 

Prnriuction of po lvketides using erythraea 

NRRL2338/PPFL44 

A frozen suspension of strains, erythraea NRRL2338/ 
P PFL44 was used to inoculate eryP medium containing 5 g/ml 
of thiostrepton and allowed to grow for seven days at 
28-30° C. After this time the broth was filtered to remove 
mycelia and the pH adjusted to P H=9. The supernatant was 
then extracted three times with an equal volume of ethyl 
acetate and the solvent was removed by evaporation. 
Products were analysed by HPLC/MS and a macrolide was 
identified which had the structure shown below, and as 
judged by MS and 1H NMR was identical to authentic 
material . : 



- 50 - 




Although the present invention is illustrated by the 
examples listed above, they should not be regarded as 
limiting the scope of the invention. The above descriptions 
illustrate for the first time the construction of a Type I 
5 PKS gene assembly containing a wholly or partly 

heterologous KSq-containing loading module and its use to 
obtain polyketide products of utility as synthetic 
intermediates or as bioactive materials such as 
antibiotics. It will readily occur to the person skilled 
10 in the art that a wholly or partly heterologous KSq- 

containing loading module from other PKS gene sets could 
be used to replace the loading module of DEBS, or indeed 



10 
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into a quite different PKS gene assembly. It will also 
readily occur to the person skilled in the art that that 
the additional specificity provided by the more efficient 
discrimination made between rnethylmalonyl-CoA and malonyl- 
CoA by an ATq, followed by specific decarboxylation by a 
KSq, is preferable to the imperfect discrimination between 
propionyl-CoA and acetyl-CoA that is a feature of the DEBS 
loading module and of many other PKS loading modules, in 
that it maximises the production of a single product rather 
than a mixture differing from each other in the nature of 
the starter unit. The avoidance of such mixtures increases 
yields and avoids the need for tedious and difficult 
separation procedures. 
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^GANISATION OF THE TYLOSIN-PRODUCING POLYKETIDE SYNTHASE 




mm 



LD MODULE 1 MODULE 2 



tyCI 



MODULE 3 
tyGH 



MODULE 6 MODULE 7 
■ k FT^T^ — 

ty/G\\\ y 



MODULE 4 MODULE 5 



ORGANISATION OF THE SPIRAMYCIN-PRODUCING POLYKETIDE SYNTHASE 




LD 



MODULE 1 MODULE 2 „ 



MODULE 4 MODULE 5 i . . \ rrr> B^H 

KSS\SN\SSS.S\SV-t J -+ J srmGlv srmGV 

srmGlll 



MODULE 6 



MODULE 7 



srmGl 



MODU LE 3 

vzzzzzzzzzzfy 
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ORGANISATION OF THE NIDDAMYCIN-PRODUCING POLYKETIDE SYNTHASE 




nidAW 



m: malonyl transferase 
mm: methylmalonyl transferase 
em: ethylmalonyl transferase 
C2: unknown C2 unit transferase 



A II of 3) Alignment of KSq-ATq loading 
ESilJ. ' of modular poVetid* synthases 
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1 MAGHGDATAQ KAQDAEKSED GSDAIAVIGM 

niddamycm - MS GELA isRSDD RSDAVAWGM 

platenolide ~~~ ~_ MAAS ASASPSGPSA GPDPIAWGM 

monensin MHVPGEE NGHSIAIVGI 

oleandomycin--— SNTAAQNTGD QEDVDGPDST HGGE I AWGM 

t \^ X o s i n 

100 

'cRFPGAPCT A FFWQTT . SSfi APAWTAAPG RRR gTIDA 

ACRFPGAPGI AEFWKLLTDG RDAIGRDADG RRR . . . - - - ■ " * * AqyLDR 
armPGAPDP DAFWRLLSEG RSAVSTAPPE RRRADSGLHG P . . . GGYLDR 
IcSSSSp QEFWRLLADS ADALDEPPAG RFPTGSLSSP PAPRGGFLDS 
SCRLPGAAGV EEFWELLRSG RGMPTRQDDG TWRAA ™& 
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PESLRGEAAS 
PGSLRGEAVG 
PPTLARSRTG 
PRHLRGTRTS 
PGDLTGTDTG 
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YVLGTRGPSL 
YAYHLAGPSL 
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RVYCWRGVA 
PVYCWRGVA 
TVYCVIRGSA 
TVYCEILGSA 
RVYCVI KGGA 
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LHGTGT PVGD 
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LHGTGTPAGD 
LHGTGTRAGD 
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FGMGGTNCHV VLEETPGG RQPAE . T 

FGIGGTNCHV VLEHLPSR PTPAV . S 

FGMGGTNGHV WA....AAP DSVAVPEPVG VPERVEVPEP VWSEPVWP 
FGMGGTNCHV VLSELRNAGG DGAGKGPYTG TEDRLGATEA EKRPDPATGN 
FGMGGTNVHL VIAEAPAAAG SSGAGGSGAG SGAGISAVSG W 
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GQADACLFSA SPMLLLSARS EQALRAQAAR LREHL . . EDS GADPLDIAYS 

VAAS LPD VPPLLLSARS EGALRAQAVR LGETV . . ERV GADPRDVAYS 

TPWP . . .VSAHS ASALRAQAGR LRTHLAAHRP TPDAARVGHA 

GPDPAQDTHR YPALILSARS DAALRAQAER LRHHL . EHSP GQRLRDTAYS 

PWVSGRS RWVREAAGR LAE..WEAG GVGLADVAVT 
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QHPGMGQELY 
QWVGMGRGLY 
QRLGMGRELY 
QQPGMGKRLH 
GAAGGAGAAG 
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AGQTPRGVRI GS . . TDADGR 
AGRVSGGVRS GR . .A.VPGG 
EGAETASIVR GEAYT . - EGR 
NGLPAPQVLL GRTPTPEPGG 
GGDPDAGWT G. . . AVLDGG 



DEVCEELQRC 
DEVLSMVGEV 
DEAFAALDVH 
DEVCAELDTH 
FPGQGTQWVG 



650 

GTQNLREVMF 
DGRSLRDVMF 
LDRPLREIVL 

LGRLL 

MGAGLLGSSE 



700 

LLDRTEYTQP ALFALQTALY 



651 

TPDQPD 

GDVDVDAGAG ADAGAGAGAG VGSGSGSVGG LLGRTEFAQP ALFALEVALF 

GETDSGGNVS GENVIGEGA DHQA LLDQTAYTQP ALFAIETSLY 

GPEAGPPLR DVMFAERGT AHSA LLSETHYTQA ALFALETALF 

VFAASMRECA RALSVHVGWD LLEWSGGAG . LERVDWQP VTWAVMVSLA 



701 

RTLTARGTQA 
RALEARGVEV 
RLAASFGLKP 
RLLVQWGLKP 
RYWQAMGVDV 



HLVLGHSVGE 
SWLGHSVGE 
DYVLGHSVGE 
DHLAGH SVG E 
AAWGHSQGE 



ITAAHIAGVL 
VAAATVAGVL 
TAAAHVAGVL 
IAAAHAAGIL 
IAAATVAGAL 



DLPDAARLIT 
SLGDAVRLW 
SL PDAS ALVA 
DLSDAAELVA 
SLEDAAAWA 



^ 750 

ARAHVMGQLP 
ARGGLMGGLP 
TRGRLMQAVR 
TRGALMRSLP 
LRAGLIGRYL 
f 



Fig. 4 continued 2 of 3 



niddam. . . 
platenol . 
monensin 
oleandom 
tylos in 



niddam. . . 
platenol . 
monensin 
oleandom 
tylosin 



751 

HG . GAMLSVQ 
VG - GGMWSVG 
AP . GAMAAWQ 
GG -GVMLSVQ 
AGRGAMAAVP 

801 

AQHLREQNVR 
VAS LMGDGVE 
TAAWRGRGRK 
EQ I LRDRGRK 
VAVCQAEGVQ 



AAEHDLDQLA 
ASESWRGW 
ATADEAAEQL 
APESEVAPLL 
LPAGEVEAGL 



HTWLKVSHAF 
YRRLDVSHGF 
AHHLKVSHAF 
SRYLRVSHAF 
ARLI PVDYAS 



HTHG . .VEIA 
EGLGEWVSVA 
AGHERHVTVA 
LGREAHVGLA 
. AKWPGVEVA 



HSALMDPMLG 
HSVLMEPVLG 
HSPHMDPILD 
HSPLMEPVLE 
HSRHVEDLKG 



AVNG PTHCVL 
AVNGPRSWL 
AVNGPDSVW 
AVNG PDAVW 
AVNGPASTW 



AFRDTLNTLN 
EFRGWESLE 
ELRAVAAGLT 
EFAEAVAGLT 
ELERVLSGI . 



800 

SGPRTALEET 
SGDVGVLESV 
SGDRATVDEL 
SGERGHVAAI 
SGDRRAVAGY 

850 

Y. .QPPTIPL 
FGRVRPGVW 
FHE . . PVIPV 
FRA. . PTTPL 
.RPRSPRVPV 



niddam. . . 
platenol . 
monensin 
oleandom 
tylos in 



niddam. . . 
platenol . 
monensin 
oleandom 
tylosin 



851 

ISNLTGQIA. 
VSGVSGGW . 
VSNVTGELVT 
VSNLTG .... 
CSTVAGEQPG 



900 

DPNHL CTPDYWIDHA RHTVRFADAV QTAHHQGTTT 

GSGEL GDPGYWVRHA REAVRFADGV GWRG LGVGT 

ATATGSGAGQ ADPEYWARHA REPVRFLSGV RGLCERGVTT 
. . APVDDRTM AT P A YWVRHV REAVRFGDG I RALGKLGTGS 
EPVF DAGYWFRNL RNRVE F SAW GGLLEEGHRR 



901 950 

YLEIGPHPTL TTLLHHTL . . . DNP T TIPTLHRERP 

LVEVGPHGVL TGMAGECLGA GDDV V WPAMRRGRA 

FVELGPDAPL SAMARDCFPA P ADRSRPRPA AIATCRRGRD 

FLEVGPDGVL TAMARACVTA APEPGHRGEQ GADADAHTAL LLPALRRGRD 
FIEVSAHPVL V HAIEQ TAEAADRSVH ATGTLRRQDD 



niddam. . . 
platenol . 
monensin 
oleandom 
tylosin 



951 

EPETLTQAIA 
EREVFEAALA 
EVATFLRSLA 
EARSLTEAVA 
SPHRLLTSTA 



AVGVRTDG I D 
TVFTRDAGLD 
QAYVRGADVD 
RLHLHGVPMD 
EAWAHG AT LT 



WAVLCGASRP 
ATALHTGSTG 
FTRAYGATAT 
WTSVLGGDVS 
WD PAL . . PPG 



RRVELPTYAF 
RRIDLPTTPF 
RRFPLPTYPF 
. RVPLPTYAF 
HLTTLPTYPF 



niddam: niddamycin; platenol: platenolide I (spiramycin); oleandom: 
oleandomycin . 
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